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Vet  tors  on  tile  Unit  3  Sphere  with  i/f  f  rjj  ~  1  ~  ** . HKJ 

The  integrand  f(s\  for  11  ~  20  and  d  —  1  versus  s.  The  i.iflection 


6  2  Minimum,  average,  ami  maximum  numbers  of  inverse  iterations  to 
compote  accurate  eigenvectors  for  fifty  random  malrit.es  of  order 
100  and  live  random  matrices  of  order  500.  The  matrices  have 


for  each  eigenvector  compulation. 


experiments. 


2.2  Measures  of  the  Quality  of  a  Method 


/Vuo/  l.et  Pi«ilmijll»i>lyiiig  «'4UdUon  (2  by  dyXT  lh«*t»  (iremulti()tyiug  tl  by  YT  gi  /<rs 


no 


and,  upon  completion,  colun 


.line  required  to  normalize  t/(  and  pass  it  through  the  j  < oinnmiiication 


used  to  further  improve  the  performance  of  paral 


The  recursive  nature  of  (’uppeu’ii  method  amnesia  its  suitability  to  iiiipldum 
latioii  on  a  hypercube  hollowing  the  divide  and  compier  strategy  described  u 


Mure  precisely,  during  step  0  processor  f\  (a  0  cube)  coin  pules  the  eigensys 
tern  (Ao,,. Vo.)  of  the  matrix  7'u,.  Because  each  rank  one  updating  step  requires 
the  eigensys  I  ems  of  two  smaller  mat  rices,  processors  must  pair  up  in  step  l  and 


or  simplicity,  denote  these  processors 


Ptt,Pl,.  , and  trpUce  subscripts  of  iht*  form  (;  1,2*)  with  0  and  (;  (t)  Permute  the  elements  of  z  accordingly  z  <-  J1  z 

1,2*  +  0  wth  1  Al  step  j ,  with  the  new  notation,  processor*  /0.  f  i.  .  i  fj)  Apply  the  product  of  plane  rotations  (i  to  zero  out  the  elements  in 

compute  the  tigensysUm  (A,  .V)  of  /hat  correspond  to  close  elements  in  D. 


matrix  fraction  of  time 
order  in  communication 


computation  than  it  is  to  broad  \st  the  results,  (The  application  of  rotations 
in  step  j.2  (d)  of  Algorithm  5.11  is  an  example).  The  communication  required 
for  the  hypercube  similarly  redu  ea  the  time  savings  due  to  deflation.  Figure 
5.2  and  Table  5  3  show  the  frartit  n  of  lime  spent  in  communication  on  a  5-cube 
for  various  orders  of  matrix  |l,2,  ).  Computation  time  is  shown  to  be  greater 


fror  ihe  model  problem  on  a  5  <uhc,  the  ratio  of  communication  time  to  compu¬ 
lation  time  ^  falls  from  U2  at  order  32  to  3  x  10  *  at  1024  The  fraction  of 
communication  time  shown  for  |l,2,lj  in  Figure  5.4  is  greater  than  the  predicted 
value  for  because  the  eigenvalue  compulation  lakes  longer  for  ImMUi  As 

shown  in  figure  5.5,  orlhogoualizaLion  does  not  greatly  alter  the  run  time  for 


eigenvalues. 
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Assuming  that  Steps  2  and  3  aie  carried  out  using  stable,  accurate  methods, 


iu  order  Table#  6  8  arid  6  9  show  maxinium  residuals  R  arid  orthogonalities  O 
taken  over  the  tested  matrices.  The  first  values  of  R  and  O  were  measured  after 
the  first  iteration  where  all  iterates  have  norm  greater  than  one,  the  second  values 
of  R  and  O  were  measured  after  the  second  iteration  where  all  iterates  have  norm 
greater  than  one.  For  all  tested  matrices,  these  were  consecutive  iterations.  For 


first  iteration  I  2Ald  ~  12  5.!4d  —  II 


result  It  does,  however,  substantially  increase  the  time  required  for  the  solution 
When  IheTINVIT  criterion  is  used,  moat  of  the  eigenvectors  are  reorlhogmialized 
(80%  when  n  =  105  and  %%  when  n  =  525),  but  reortliogonahzation  occurs  III 
111  any  small  clusters  Thus,  when  all  eigenvectors  are  reorthogonaliied  as  one 
group,  the  cost  rises  markedly  although  the  accuracy  is  little  changed  1  lie  same 
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I'or  tins  analysis  of  inverse  iteration,  it  is  assumed  that  the  starting  vector  is 
created  in  the  following  way.  A  vector  io  is  first  generated  with  independent 
random  components  each  having  a  normal  distribution  with  mean  U  and  vari 


meant  by  a  “good"  approximation  to  an  eigenvector  or  to  a  linear  combination 


of  eigenvectors  Equivalent  geometric  and  analytic  definitions  follow  along  with 
examples  in  three  dimensions. 

Let  y„  =  E?*i  >).u.  satisfy  ||  y0  III  I  Vo  >»  considered  a  good  approximation 
to  u,  if  rj,  is  mueli  larger  than  any  other  component,  it  if  i if  >  1  -  *J  hir  some 
tiny  error  tolerance  r.  Smularly,  y0  is  maily  a  linear  combination  of  eigenvectors 


cone  around  u3  having  vertex  at  the  center  ol 


|54]  to  esfimufe  -y-)  The  mean  value  theorem  for  definite*  integrals  stales 

that  if  the  integrand  </(l)  is  continuous  over  |t|,  t2 j .  there  exists  a  number  r3  such 
that 


npn 


u  pin?  z  <  p  punnq  udiin  .ii|)  k.iai*  (f  p)  uoinmb^ 


tie  number  of  quadrature  nodes  la  chosen  so 


linearly  independent.  In  this  section,  the  analytic  convergence  of  inverse  iteration  in  any  eigenvector  direct  101 


Table  7  8:  Properties  of  tome  distributions. 


exanipl 


8.5  discusses  the  parallelism  of  the  methods 


'll  3  ('£/  n) 


these  deflation  rules  (46|. 


matrix  I  order  time  for  time  for  |  lime  for 


>e  reorthogonalized  independently.  Similarly,  parallel  apt^edup  of  al>out  ~p  can 


almost  no  deflation  but  that  the  sptredup  falls  to  about  |p  for  problems  with  a 
moderate  degree  of  deflation.  Even  lower  specdups  ran  be  expected  for  problems 
where  significant  deflation  occurs.  A  similar  result  can  be  anticipated  for  PSVD. 

The  experiments  of  Section  8.4,  however,  show  that  deflation  is  less  prevalent 
for  PSVl)  than  for  TREEQL.  Thus,  a  generally  higher  speedup  could  be  expected 
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